Towards spoken clinical-question answering: evaluating and adapting automatic speech-recognition systems for spoken clinical questions

نویسندگان

  • Feifan Liu
  • Gökhan Tür
  • Dilek Z. Hakkani-Tür
  • Hong Yu
چکیده

OBJECTIVE To evaluate existing automatic speech-recognition (ASR) systems to measure their performance in interpreting spoken clinical questions and to adapt one ASR system to improve its performance on this task. DESIGN AND MEASUREMENTS The authors evaluated two well-known ASR systems on spoken clinical questions: Nuance Dragon (both generic and medical versions: Nuance Gen and Nuance Med) and the SRI Decipher (the generic version SRI Gen). The authors also explored language model adaptation using more than 4000 clinical questions to improve the SRI system's performance, and profile training to improve the performance of the Nuance Med system. The authors reported the results with the NIST standard word error rate (WER) and further analyzed error patterns at the semantic level. RESULTS Nuance Gen and Med systems resulted in a WER of 68.1% and 67.4% respectively. The SRI Gen system performed better, attaining a WER of 41.5%. After domain adaptation with a language model, the performance of the SRI system improved 36% to a final WER of 26.7%. CONCLUSION Without modification, two well-known ASR systems do not perform well in interpreting spoken clinical questions. With a simple domain adaptation, one of the ASR systems improved significantly on the clinical question task, indicating the importance of developing domain/genre-specific ASR systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Study on Spoken Interactive Open Domain Question Answering

This paper proposes an interactive approach to spoken interactive open-domain question answering (ODQA) systems. The goal of ODQA systems is to extract an exact answer to user’s question from unstructured information sources such as large text corpora. When the reliabilities for answer hypotheses obtained by an ODQA system are low, systems need more information to effectively distinguish the ex...

متن کامل

Geovaqa: a Voice Activated Geographical Question Answering System

In this paper we present GeoVAQA, a Restricted Domain Spoken Question Answering system in the scope of the Spanish geography. The system consists of a webbased application that allows speech input questions about Spanish geography and sends back a concise textual answer. In our system, spoken questions are recognised by an automatic speech recognition (ASR) system. We have used RAMSES, a Spanis...

متن کامل

Towards Speech-Driven Question Answering: Experiments Using the NTCIR-3 Question Answering Collection

We developed a method for producing statistical language models for speech-driven question answering, which recognizes spoken questions with high accuracy. Our method uses a target collection (i.e., a document set from which answers are derived) to extract N-grams, and adapts them to the questionanswering task by way of frozen patterns typically used in interrogative questions. In addition, our...

متن کامل

A Speech Interface for Open-Domain Question-Answering

Speech interfaces to question-answering systems offer significant potential for finding information with phones and mobile networked devices. We describe a demonstration of spoken question answering using a commercial dictation engine whose language models we have customized to questions, a Web-based textprediction interface allowing quick correction of errors, and an open-domain question-answe...

متن کامل

Integrating spoken dialog and question answering: the ritel project

The Ritel project aims at integrating spoken language dialog and open-domain information retrieval to allow a human to ask general questions (e.g. Who is currently presiding the French Senate?) and refine her search interactively. This project is at the junction of several distinct research communities, and has therefore several challenges to tackle: real-time streamed speech recognition with v...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of the American Medical Informatics Association : JAMIA

دوره 18 5  شماره 

صفحات  -

تاریخ انتشار 2011